CDS

Accession Number TCMCG075C23602
gbkey CDS
Protein Id XP_007018290.2
Location complement(join(3545725..3545979,3546089..3546215,3546400..3546516,3546648..3546804,3546921..3547062,3547308..3547398,3547581..3547687,3547763..3547918,3548147..3548317,3548490..3548631,3548876..3548995,3549100..3549219,3549311..3549370,3549482..3549603,3549888..3549929))
Gene LOC18591839
GeneID 18591839
Organism Theobroma cacao

Protein

Length 642aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007018228.2
Definition PREDICTED: probable rhamnogalacturonate lyase B [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description Rhamnogalacturonate lyase
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K18195        [VIEW IN KEGG]
EC 4.2.2.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTCATGGCCAGGGGTGCAACTGCTTATTCGAGATCATTATGTGGTAATGGATAATGGAATACTCCAAGTGACAATATCGAGTCCTGACGGAATTGTCACTGGGATACGATATAATGGCATCGACAATGTGCTTGAAGTTCAAGATGAGGAAGTTGAAAGAGGGTACTGGGATCTAGTCTGGAGTAAAACAGGAAGTACAGGAACAACAGGAACATTTGATGTGTTTAAAGGAACAAGTTTTAAGGTTGTTGTAGAAAATGAGGACCAAGTAGAGATCTCATTCACAAGAACATGGGATTTCTCTCTTGAGGGCAACGTTGTTCCCTTAAATTTAGACAAAAGGTTCATAATGCTTCGAAATTCTTCGGGATTCTATTCCTACGCCATCTTTGAACACTTGGGGGAATGGCCTCCATTCAATCTTCCACAAGTCAGAATTGTTTTCAAGCTCAGAAAAGACAAATTTCACTACATGGCCGTAGCAGACAACAGGCAAAGATTCATGCCCCTGCCTGATGACCGGTTACCAGACAGGGGCCAACCTCTGGCAACACCAGAAGCTGTCCTGCTTGTCAATCCTGTGGAGCCAGAGTTCAGAGGAGAGGTGGATGACAAGTACCAGTACTCATCCGAGAACAAAGATCTAAAAGTTCATGGGTGGATATCCTTGAACCCCCCAATTGGTTTCTGGCAAATTACTCCCAGCAGTGAATTCCGGTCAGGTGGACCCGTCAAACAGAACCTGACCTCCCATGTTGGCCCCTATGCTCTTGCAATGTTTCTTAGTGCTCATTATGCCGGAGAGGATTTGGTGCTGAAGCTGAACCCAGGAGAGCCGTGGAAAAAGGTTTTCGGCCCAGTTTTTTTGTATCTGAATTCTCTCTGGGATAAGGACAATGTATATTCCCTTTGGGAGGATGCTAAAGACCAGATGCAGATGGAAGTCCAGAACTGGCCCTACAGTTTTCCAGCATCAGAGGATTTTCCAAAATCAGACCAGCGGGGCAAAGCATGCGGCAGATTAAAAGTTCAAGACAGGTATGTTAGCTATGACTGCATTCCGGCAAACGGTGCTTATGTGGGCTTGGCACCACCAGGAGAGGTCGGATCATGGCAAAGAGAATGCAAGGGTTACCAATTCTGGACCAGATCAGATGAAGATGGCAACTTTGCAGTCGAGAATATAAGGGCTGGTGAATATAATATTTATGCATGGGTCCCTGGTTTCATCGGAGATTACAAATATGATGCTGTCATTAACATTACAGAAGGTTGTACCACTGACGTGGGTGATCTGATATTTGAGCCTCCAAGAGATGGTCCTACATTATGGGAAATAGGCATACCTGATCGCTCTGCAGCAGAGTTTTACATCCCTGACCCTGACCCTATGTACATTAACAGACTTTATGTCAACCATCCTGACAGGTATAGACAGTATGGGTTGTGGGAAAGATATGCTGATCTATATCCAGATGGAGACTTGGTTTTCACAGTTGGCGATAGTGACTACGAAAAAGATTGGTTCTTTGCTCAAGTTAACAGGAAGAAAGAAGATGGTACATATCAAGGAACTACATGGCAAATCAAGTTCATGCTCAATATTGTAGATCATACTGGAACTTATATATTGCGATTGGCCCTAGCAACTGCACATCTTGCTGAATTGCAGGTTCGGATCAATGATCCAAAAGCAGACCCTCCTCTGTTCACAACTGGACAAATTGGGCATGACAACACAATCGCAAGGCATGGAATTCATGGGCTCTACCGGCTTTACAACGTAGATGTGCCGGGAGTTCAGCTTGTGGAAGGGGAAAATATCGTTTTTCTGACACAAGCAATAAACACTGATCCACTTCAGGGTATCATGTATGACTATATAAGGCTAGAATGTCCCCCTTCTAGTTCCAGCAGAAAGCTTTGA
Protein:  
MSWPGVQLLIRDHYVVMDNGILQVTISSPDGIVTGIRYNGIDNVLEVQDEEVERGYWDLVWSKTGSTGTTGTFDVFKGTSFKVVVENEDQVEISFTRTWDFSLEGNVVPLNLDKRFIMLRNSSGFYSYAIFEHLGEWPPFNLPQVRIVFKLRKDKFHYMAVADNRQRFMPLPDDRLPDRGQPLATPEAVLLVNPVEPEFRGEVDDKYQYSSENKDLKVHGWISLNPPIGFWQITPSSEFRSGGPVKQNLTSHVGPYALAMFLSAHYAGEDLVLKLNPGEPWKKVFGPVFLYLNSLWDKDNVYSLWEDAKDQMQMEVQNWPYSFPASEDFPKSDQRGKACGRLKVQDRYVSYDCIPANGAYVGLAPPGEVGSWQRECKGYQFWTRSDEDGNFAVENIRAGEYNIYAWVPGFIGDYKYDAVINITEGCTTDVGDLIFEPPRDGPTLWEIGIPDRSAAEFYIPDPDPMYINRLYVNHPDRYRQYGLWERYADLYPDGDLVFTVGDSDYEKDWFFAQVNRKKEDGTYQGTTWQIKFMLNIVDHTGTYILRLALATAHLAELQVRINDPKADPPLFTTGQIGHDNTIARHGIHGLYRLYNVDVPGVQLVEGENIVFLTQAINTDPLQGIMYDYIRLECPPSSSSRKL